Variable UB-Trees: An efficient way to accelerate OLAP queries1
نویسندگان
چکیده
Pre-computation, clustering and indexing are common techniques to speed up query processing. Precomputation results in the best query response time at the expense of load performance and secondary storage space. For data warehousing (DW) applications, pre-computation is mostly discussed for aggregation operations [CD97]. Indexing is used to efficiently process a query if the result set defined by the query restrictions is fairly small. Favoring retrieval response time over update response time allows to build several indexes on one table or data cube of a data warehouse. Bitmap indexes are widely discussed as an improvement over B-Trees for DW applications, since they efficiently evaluate queries with multi-attribute restrictions. However, the overall result set still must be relatively small. This is a major drawback of bitmap indexes, since usually a relatively large part of a cube must be accessed in order to calculate aggregated measures. Clustering places data that is likely to be accessed together physically close to each other. The goal of clustering is to limit the number of disk accesses required to process a query by increasing the likelihood that query results have already been cached.
منابع مشابه
Interactive ROLAP on Large Datasets: A Case Study with UB-Trees
Online Analytical Processing (OLAP) requires query response times within the range of a few seconds in order to allow for interactive drilling, slicing, or dicing through an OLAP cube. While small OLAP applications use multidimensional database systems, large OLAP applications like the SAP BW rely on relational (ROLAP) databases for efficient data storage and retrieval. ROLAP databases use spec...
متن کاملProcessing relational OLAP queries with UB-Trees and multidimensional hierarchical clustering
Multidimensional access methods like the UBTree can be used to accelerate almost any query processing operation, if proper query processing algorithms are used: Relational queries or SQL queries consist of restrictions, projections, ordering, grouping and aggregation, and join operations. In the presence of multidimensional restrictions or sorting, multidimensional range query or Tetris algorit...
متن کاملThe Transbase Hypercube RDBMS: Multidimensional Indexing of Relational Tables
Only few multidimensional access methods have made their way into commercial relational DBMS. Even if a RDBMS ships with a multidimensional index, the multidimensional index usually is an add-on like Oracle SDO, which is not integrated into the SQL interpreter, query processor and query optimizer of the DBMS kernel. Our demonstration shows TransBase HyperCube, a commercial RDBMS, whose kernel f...
متن کاملBulk Loading a Data Warehouse Built Upon a UB-Tree
This paper considers the issue of bulk loading large data sets for the UB-Tree, a multidimensional index structure. Especially in dataware housing (DW), data mining and OLAP it is necessary to have efficient bulk loading techniques, because loading occurs not continuously, but only from time to time with usually large data sets. We propose two techniques, one for initial loading, which creates ...
متن کاملExploitation of Pre-sortedness for Sorting in Query Processing: The TempTris-Algorithm for UB-Trees
Bulk loading is used to efficiently build a table or access structure, if a large data set is available at index creation time, e.g., the spool process of a data warehouse or the creation of intermediate results during query processing. In this paper we introduce the TempTris algorithm that creates a multidimensional partitioning from a one-dimensionally sorted stream of tuples. In order to ach...
متن کامل